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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Lai, Preeti 

Bandman, Olga 

(ii) TITLE OF THE INVENTION: NOVEL HUMAN SODIUM-DEPENDENT 

PHOSPHATE CO-TRANSPORTER 

(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94304 

{v> COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
<C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(C) CLASSIFICATION: 

Ivii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Billings, Lucy J- 
<B) REGISTRATION NUMBER: 36,749 
(C) REFERENCE/DOCKET NUMBER: PF-0221 US 

<ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 415-855-0555 
(B> TELEFAX: 415-845-4166 



(2) INFORMATION FOR SEQ ID NO:l: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Gin Val Asp Glu Thr Leu He Pro Arg Lys Val 
5 10 
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Ser Ala Arg Tyr Gly He Ala Leu Val Leu His Phe Cys Asn Phe Thr 

20 25 30 

Thr He Ala Gin Asn Val He Met Asn He Thr Met Val Ala Met Val 

35 40 45 

Asn Ser Thr Ser Pro Gin Ser Gin Leu Asn Asp Ser Ser Glu Val Leu 

50 55 60 

Pro Val Asp Ser Phe Gly Gly Leu Ser Lys Ala Pro Lys Ser Leu Pro 
65 70 75 80 

Ala Lys Ser Ser He Leu Gly Gly Gin Phe Ala He Trp Glu Arg Trp 

85 90 95 

Gly Pro Pro Gin Glu Arg Ser Arg Leu Cys Ser He Ala Leu Ser Gly 

100 105 110 

Met Leu Leu Gly Cys Phe Thr Ala He Leu He Gly Gly Phe He Ser 

115 120 125 

Glu Thr Leu Gly Trp Pro Phe Val Phe Tyr lie Phe Gly Gly Val Gly 

130 135 140 

Cys Val Cys Cys Leu Leu Trp Phe Val Val He Tyr Asp Asp Pro Val 
145 150 155 160 

Ser Tyr Pro Trp He Ser Thr Ser Glu Lys Glu Tyr He He Ser Ser 

165 170 175 

Leu Lys Gin Gin Val Gly Ser Ser Lys Gin Pro Leu Pro He Lys Ala 

180 185 190 

Met Leu Arg Ser Leu Pro He Trp Ser He Cys Leu Gly Cys Phe Ser 

195 200 205 

His Gin Trp Leu Val Ser Thr Met Val Val Tyr lie Pro Thr Tyr He 

210 215 220 

Ser Ser Val Tyr His Val Asn He Arg Asp Asn Gly Leu Leu Ser Ala 
225 230 235 240 

Leu Pro Phe He Val Ala Trp Val He Gly Met Val Gly Gly Tyr Leu 

245 250 255 

Ala Asp Phe Leu Leu Thr Lys Lys Phe Arg Leu He Thr Val Arg Lys 

260 265 270 

He Ala Thr He Leu Gly Ser Leu Pro Ser Ser Ala Leu He Val Ser 

275 280 285 

Leu Pro Tyr Leu Asn Ser Gly Tyr He Thr Ala Thr Ala Leu Leu Thr 

290 295 300 

Leu Ser Cys Gly Leu Ser Thr Leu Cys Gin Ser Gly He Tyr He Asn 
305 310 315 320 

Val Leu Asp He Ala Pro Arg Tyr Ser Ser Phe Leu Met Gly Ala Ser 

325 330 335 

Arg Gly Phe Ser Ser He Ala Pro Val He Val Pro Thr Val Ser Gly 

340 345 350 

Phe Leu Leu Ser Gin Asp Pro Glu Phe Gly Trp Arg Asn Val Phe Phe 

355 360 365 

Leu Leu Phe Ala Val Asn Leu Leu Gly Leu Leu Phe Tyr Leu He Phe 

370 375 380 

Gly Glu Ala Asp Val Gin Glu Trp Ala Lys Glu Arg Lys Leu Thr Arg 
385 390 395 400 

Leu 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANBEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met Gin Val Asp 
1 

Ser Ala Arg Tyr 
20 

Thr He Ala Gin 
35 

Asn Ser Thr Ser 
50 

Pro Val Asp Ser 
65 

Ala Lys Ser Ser 

Gly Pro Pro Gin 

100 

Met Leu Leu Gly 
115 

Glu Thr Leu Gly 
130 

Cys Val Cys Cys 
145 

Ser Tyr Pro Trp 

Leu Lys Gin Gin 
180 

Met Leu Arg Ser 
195 

His Gin Trp Leu 
210 

Ser Ser Val Tyr 
225 

Leu Pro Phe He 

Ala Asp Phe Leu 
260 

He Ala Thr He 
275 

Leu Pro Tyr Leu 
290 

Leu Ser Cys Gly 

305 

Val Leu Asp He 

Arg Gly Phe Ser 
340 

Phe Leu Leu Ser 
355 

Leu Leu Phe Ala 
370 

Gly Glu Ala Asp 
385 



Glu Thr Leu He 
5 

Gly lie Ala Leu 

Asn Val lie Met 
40 

Pro Gin Ser Gin 
55 

Phe Gly Gly Leu 
70 

He Leu Gly Gly 
85 

Glu Arg Ser Arg 

Cys Phe Thr Ala 
120 

Trp Pro Phe Val 
135 

Leu Leu Trp Phe 
150 

He Ser Thr Ser 
165 

Val Gly Ser Ser 

Leu Pro He Trp 
200 

Val Ser Thr Met 
215 

His Val Asn He 
230 

Val Ala Trp Val 
245 

Leu Thr Lys Lys 

Leu Gly Ser Leu 
280 

Asn Ser Gly Tyr 
295 

Leu Ser Thr Leu 
310 

Ala Pro Arg Tyr 
325 

Ser He Ala Pro 

Gin Asp Pro Glu 
360 

Val Asn Leu Leu 
375 

Val Gin Glu Trp 
390 



Pro Arg Lys Val 
10 

Val Leu His Phe 
25 

Asn He Thr Met 

Leu Asn Asp Ser 

60 

Ser Lys Ala Pro 
75 

Gin Phe Ala He 
90 

Leu Cys Ser He 
105 

He Leu He Gly 

Phe Tyr He Phe 

140 

Val Val He Tyr 
155 

Glu Lys Glu Tyr 
170 

Lys Gin Pro Leu 
185 

Ser He Cys Leu 

Val Val Tyr He 
220 

Arg Asp Asn Gly 
235 

He Gly Met Val 
250 

Phe Arg Leu lie 
265 

Pro Ser Ser Ala 

He Thr Ala Thr 
300 

Cys Gin Ser Gly 
315 

Ser Ser Phe Leu 
330 

Val He Val Pro 
345 

Phe Gly Trp Arg 

Gly Leu Leu Phe 
380 

Ala Lys Glu Arg 
395 



Pro Ser Leu Cys 
15 

Cys Asn Phe Thr 
30 

Val Ala Met Val 
45 

Ser Glu Val Leu 

Lys Ser Leu Pro 
80 

Trp Glu Arg Trp 
95 

Ala Leu Ser Gly 
110 

Gly Phe He Ser 
125 

Gly Gly Val Gly 

Asp Asp Pro Val 
160 

He He Ser Ser 
175 

Pro He Lys Ala 
190 

Gly Cys Phe Ser 
205 

Pro Thr Tyr He 

Leu Leu Ser Ala 
240 

Gly Gly Tyr Leu 
255 

Thr Val Arg Lys 
270 

Leu He Val Ser 
285 

Ala Leu Leu Thr 

He Tyr He Asn 
320 

Met Gly Ala Ser 
335 

Thr Val Ser Gly 
350 

Asn Val Phe Phe 

365 

Tyr Leu He Phe 

Lys Leu Thr Arg 
400 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 467 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi ) SEQUI 

Met Gin Met Asp 
1 

Ser Phe Arg Tyr 
20 

lie Thr Ala Gin 
35 

Asn Ser Thr Asp 
50 

Leu Asp Asn lie 
65 

Gly lie lie Leu 

Pro Val Gly Tyr 
100 

Phe Ala Leu Cys 
115 

Ala Gly lie Gly 
130 

Ala Ala Gin Gly 
145 

Trp Ala Pro Pro 

Gly Phe Leu Leu 
180 

Cys Glu Ser Leu 
195 

Gly Cys Ala Val 
210 

Lys Asp His Pro 
225 

Ser Leu Val Gin 

Ala lie Leu Lys 
260 

Thr Phe Phe Trp 
275 

lie Asn Ser Met 

290 

Ser Leu Pro Tyr 
305 

Leu Ser Asp Phe 

Arg Lys Leu Phe 
340 

Val Cys Leu Pro 
355 

Leu lie Leu Ala 
370 

lie Asn Gly Leu 

385 

Cys Ser Thr Leu 



DNCE DESCRIPTION: 

Asn Arg Leu Pro 
5 

Gly Leu Ser Phe 

Arg Ala Cys Leu 
40 

Pro His Gly Leu 
55 

Lys Asn Pro Met 
70 

Ser Ser Thr Ser 
85 

Phe Ser Gly He 

Leu Ser Ser Val 
120 

Val Ala Trp Val 
135 

He Val Ala Thr 

150 

Leu Glu Arg Gly 
165 

Gly Pro Phe He 

Gly Trp Pro Met 

200 

Cys Leu Leu Trp 
215 

Cys He Ser He 
230 

Gin Val Ser Ser 
245 

Ser Leu Pro Val 

Ser His Asn He 
280 

Leu His Val Asn 
295 

Leu Phe Ala Trp 
310 

Phe Leu Thr Arg 
325 

Thr Ala Ala Gly 

Tyr Leu Ser Ser 
360 

Gly Ala Thr Gly 
375 

Asp He Ala Pro 
390 

Thr Gly Met He 
405 



: SEQ ID NO:3: 

Pro Lys Lys Val 
10 

Leu Val His Cys 
25 

Asn Leu Thr Met 

Pro Asn Thr Ser 
60 

Tyr Asn Trp Ser 
75 

Tyr Gly Val He 
90 

Tyr Ser Thr Lys 
105 

Leu Ser Leu Leu 

Val Val Cys Arg 
140 

Ala Gin Phe Glu 
155 

Arg Leu Thr Ser 
170 

Val Leu Leu Val 
185 

Val Phe Tyr He 

Phe Val Leu Phe 
220 

Ser Glu Lys Glu 
235 

Ser Arg Gin Ser 
250 

Trp Ala He Ser 
265 

Met Thr Leu Tyr 

He Lys Glu Asn 

300 

He Cys Gly Asn 
315 

Asn He Leu Ser 
330 

Phe Leu Leu Pro 
345 

Thr Phe Tyr Ser 

Ser Phe Cys Leu 
380 

Arg Tyr Phe Gly 

395 

Gly Gly Leu He 
410 



Pro Gly Phe Cys 
15 

Cys Asn Val He 
30 

Val Val Met Val 
45 

Thr Lys Lys Leu 

Pro Asp He Gin 
80 

He He Gin Val 
95 

Lys Met He Gly 
110 

He Pro Pro Ala 
125 

Ala Val Gin Gly 

He Tyr Val Lys 
160 

Met Ser Thr Ser 
175 

Thr Gly Val He 
190 

Phe Gly Ala Cys 

205 

Tyr Asp Asp Pro 

Tyr He Thr Ser 
240 

Leu Pro He Lys 
255 

He Gly Ser Phe 
270 

Thr Pro Met Phe 
285 

Gly Phe Leu Ser 

Leu Ala Gly Gin 
320 

Val He Ala Val 
335 

Ala He Phe Gly 
350 

He Val He Phe 
365 

Gly Gly Val Phe 

Phe He Lys Ala 
400 

Ala Ser Thr Leu 
415 
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Thr Gly Leu He Leu Lys Gin Asp Pro Glu Ser Ala Trp Phe Lys Thr 

420 425 430 

Phe He Leu Met Ala Ala He Asn Val Thr Gly Leu He Phe Tyr Leu 

435 440 445 

He Val Ala Thr Ala Glu He Gin Asp Trp Ala Lys Glu Lys Gin His 

450 455 460 

Thr Arg Leu 
465 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Glu Phe Arg Gin Glu Glu Phe Arg Lys Leu Ala Gly Arg Ala Leu 

15 10 15 

Gly Arg Leu His Arg Leu Leu Glu Lys Arg Gin Glu Gly Ala Glu Thr 

20 25 30 

Leu Glu Leu Ser Ala Asp Gly Arg Pro Val Thr Thr His Thr Arg Asp 

35 40 45 

Pro Pro Val Val Asp Cys Thr Cys Phe Gly Leu Pro Arg Arg Tyr He 

50 55 60 

He Ala He Met Ser Gly Leu Gly Phe Cys He Ser Phe Gly He Arg 
65 70 75 80 

Cys Asn Leu Gly Val Ala He Val Ser Met Val Asn Asn Ser Thr Thr 

85 90 95 

His Arg Gly Gly His Val Val Val Gin Lys Ala Gin Phe Asn Trp Asp 

100 105 110 

Pro Glu Thr Val Gly Leu He His Gly Ser Phe Phe Trp Gly Tyr He 

115 120 125 

Val Thr Gin He Pro Gly Gly Phe He Cys Gin Lys Phe Ala Ala Asn 

130 135 140 

Arg Val Phe Gly Phe Ala He Val Ala Thr Ser Thr Leu Asn Met Leu 
145 150 155 160 

He Pro Ser Ala Ala Arg Val His Tyr Gly Cys Val He Phe Val Arg 

165 170 175 

He Leu Gin Gly Leu Val Glu Gly Val Thr Tyr Pro Ala Cys His Gly 

180 185 190 

He Trp Ser Lys Trp Ala Pro Pro Leu Glu Arg Ser Arg Leu Ala Thr 

195 200 205 

Thr Ala Phe Cys Gly Ser Tyr Ala Gly Ala Val Val Ala Met Pro Leu 

210 215 220 

Ala Gly Val Leu Val Gin Tyr Ser Gly Trp Ser Ser Val Phe Tyr Val 
225 230 235 240 

Tyr Gly Ser Phe Gly He Phe Trp Tyr Leu Phe Trp Leu Leu Val Ser 

245 250 255 

Tyr Glu Ser Pro Ala Leu His Pro Ser He Ser Glu Glu Glu Arg Lys 

260 265 270 

Tyr He Glu Asp Ala He Gly Glu Ser Ala Lys Leu Met Asn Pro Val 

275 280 285 

Thr Lys Phe Asn Thr Pro Trp Arg Arg Phe Phe Thr Ser Met Pro Val 

290 295 300 

Tyr Ala He He Val Ala Asn Phe Cys Arg Ser Trp Thr Phe Tyr Leu 
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305 310 315 320 

Leu Leu lie Ser Gin Pro Ala Tyr Phe Glu Glu Val Phe Gly Phe Glu 

325 330 335 

lie Ser Lys Val Gly Leu Val Ser Ala Leu Pro His Leu Val Met Thr 

340 345 350 

lie lie Val Pro lie Gly Gly Gin lie Ala Asp Phe Leu Arg Ser Arg 

355 360 365 

His lie Met Ser Thr Thr Asn Val Arg Lys Leu Met Asn Cys Gly Gly 

370 375 380 

Phe Gly Met Glu Ala Thr Leu Leu Leu Val Val Gly Tyr Ser His Ser 
385 390 395 400 

Lys Gly Val Ala He Ser Phe Leu Val Leu Ala Val Gly Phe Ser Gly 

405 410 415 

Phe Ala He Ser Gly Phe Asn Val Asn His Leu Asp He Ala Pro Arg 

420 425 430 

Tyr Ala Ser lie Leu Met Gly He Ser Asn Gly Val Gly Thr Leu Ser 

435 440 445 

Gly Met Val Cys Pro He He Val Gly Ala Met Thr Lys His Lys Thr 

450 455 460 

Arg Glu Glu Trp Gin Tyr Val Phe Leu He Ala Ser Leu Val His Tyr 
465 470 475 480 

Gly Gly Val He Phe Tyr Gly Val Phe Ala Ser Gly Glu Lys Gin Pro 

485 490 495 

Trp Ala Glu Pro Glu Glu Met Ser Glu Glu Lys Cys Gly Phe Val Gly 

500 505 510 

His Asp Gin Leu Ala Gly Ser Asp Glu Ser Glu Met Glu Asp Glu Val 

515 520 525 

Glu Pro Pro Gly Ala Pro Pro Ala Pro Pro Pro Ser Tyr Gly Ala Thr 

530 535 540 

His Ser Thr Val Gin Pro Pro Arg Pro Pro Pro Pro Val Arg Asp Tyr 
545 550 555 560 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATTTATATCA ATGTCTTAGA TATTGCTCCA AGGTATTCCA GTTTTCTCAT GGGAGCATCA 60 

AGAGGATTTT CGAGCATAGC ACCTGTCATT GTACCCACTG TCAGTGGATT TCTTCTTAGT 120 

CAGGACCCTG AGTTTGGGTG GAGGAATGTC TTCTTCTTGC TGTTTGCCGT TAACCTGTTA 180 

GG AC T AC TC T TCTACCTCAT ATTTGGAGAA GCAGATGTCC AAGAATGGGC TAAAGAGAGA 240 

AAACTCACTC GTTTATGAAG TTATCCCACC TT 272 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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Cys Thr Thr Gly Ala Thr Gly Cys Thr Cys Cys Cys Ala Thr Gly Ala 

15 10 15 

Gly Ala Ala Ala Ala Cys Thr Gly Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ala Gly Gly Ala Thr Thr Thr Thr Cys Gly Ala Gly Cys Ala Thr Ala 

15 10 15 

Gly Cys Ala Cys Cys Thr Gly Thr Cys 
20 25 
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